A New Genome-Wide Method to Track Horizontally Transferred Sequences: Application to Drosophila
نویسندگان
چکیده
Because of methodological breakthroughs and the availability of an increasing amount of whole-genome sequence data, horizontal transfers (HTs) in eukaryotes have received much attention recently. Contrary to similar analyses in prokaryotes, most studies in eukaryotes usually investigate particular sequences corresponding to transposable elements (TEs), neglecting the other components of the genome. We present a new methodological framework for the genome-wide detection of all putative horizontally transferred sequences between two species that requires no prior knowledge of the transferred sequences. This method provides a broader picture of HTs in eukaryotes by fully exploiting complete-genome sequence data. In contrast to previous genome-wide approaches, we used a well-defined statistical framework to control for the number of false positives in the results, and we propose two new validation procedures to control for confounding factors. The first validation procedure relies on a comparative analysis with other species of the phylogeny to validate HTs for the nonrepeated sequences detected, whereas the second one built upon the study of the dynamics of the detected TEs. We applied our method to two closely related Drosophila species, Drosophila melanogaster and D. simulans, in which we discovered 10 new HTs in addition to all the HTs previously detected in different studies, which underscores our method's high sensitivity and specificity. Our results favor the hypothesis of multiple independent HTs of TEs while unraveling a small portion of the network of HTs in the Drosophila phylogeny.
منابع مشابه
A genome-wide association study identifies a horizontally transferred bacterial surface adhesin gene associated with antimicrobial resistant strains
Carbapenems are a class of last-resort antibiotics; thus, the increase in bacterial carbapenem-resistance is a serious public health threat. Acinetobacter baumannii is one of the microorganisms that can acquire carbapenem-resistance; it causes severe nosocomial infection, and is notoriously difficult to control in hospitals. Recently, a machine-learning approach was first used to analyze the ge...
متن کاملThe copia retrotransposon and horizontal transfer in Drosophila willistoni.
The copia element is a retrotransposon that is hypothesized to have been horizontally transferred from Drosophila melanogaster to some populations of Drosophila willistoni in Florida. Here we have used PCR and Southern blots to screen for sequences similar to copia element in South American populations of D. willistoni, as well as in strains previously shown to be carriers of the element. We ha...
متن کاملGenome Wide Association Studies, Next Generation Sequencing and Their Application in Animal Breeding and Genetics: A Review
Recently genetic studies have been revolutionized by next generation sequencing (NGS) technology, and it is expected that the use of this technology will largely eliminate defects in the methods of association studies. The NGS technology is becoming the premier tool in genetics. However, at the moment the use of this method is limited especially in the livestock due to high cost and computation...
متن کاملA computational tool for the genomic identification of regions of unusual compositional properties and its utilization in the detection of horizontally transferred sequences.
Similarity Plot (S-plot) is a Windows-based application for large-scale comparisons and 2-dimensional visualization of compositional similarities between genomic sequences. This application combines 2 approaches widely used in genomics: window analysis of statistical characteristics along genomes and dot-plot visual representation. S-plot is effective in identifying highly similar regions betwe...
متن کاملRegions of Unusual Statistical Properties as Tools in the Search for Horizontally Transferred Genes in Escherichia coli
The observed diversity of statistical characteristics along genomic sequences is the result of the influences of a variety of ongoing processes including horizontal gene transfer, gene loss, genome rearrangements, and evolution. The rate at which various processes affect the genome typically varies between different genomic regions. Thus, variations in statistical properties seen in different r...
متن کامل